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METHODS OF MODIFYING THE PRODUCTION OF ISOPENTENYL 
PYROPHOSPHATE, DIMETHYLALLYL PYROPHOSPHATE AND/OR 

ISOPRENOIDS 

BA C KGROUND QF TH E INVENTION 
Field of the Invention 
The present invention is directed to genes encoding deoxyxylulose-5- 
phosphate synthase (dxps) t glyceraldehyde-3-phosphate dehydrogenase (gapd), 
and the lytB gene product, as well as vectors containing the same and hosts 
transformed with said vectors. The present invention also provides methods for 
modifying the production of isopentenyl pyrophosphate (IPP) and/or dimethylallyl 
pyrophosphate (DMAPP) and/or an isoprenoid compound (i.e., a compound derived 
from IPP and/or DMAPP). Additionally, the present invention provides a method 
for screening for procaryotic and eukaryotic genes encoding enzymes that 
participate in the nonmevalonate pathway leading to IPP and DMAPP. 

Background of the Invention 
A plethora of chemical compounds are produced by what are collectively 
known as the pathways of isoprenoid (a.k.a. terpene) biosynthesis. The one feature 
in common to the many isoprenoids (more than 20,000 have been identified in 
plants; Chappell, 1995) is their biosynthesis from the central metabolites and 
building blocks for all isoprenoid compounds: the 5 carbon compounds isopentenyl 
pyrophosphate (IPP) and its allylic isomer dimethylallyl pyrophosphate (DMAPP). 
The interconversion of IPP and DMAPP is reversible, and is catalyzed by the 
enzyme IPP isomerase (IPI). Using the IPP and DMAPP building blocks in various 
combinations, a modular assembly process that produces compounds of 5, 10, 15, 
20 or more carbons (in multiples of 5) allows the biosynthesis of the basic skeletons 
for the many and various isoprenoids with a relatively small number of basic 
reaction steps. The C^ skeleton of plant carotenoid pigments, for instance, is 
assembled from two molecules of a C 20 compound, geranylgeranyl pyrophosphate 
(GGPP), that is itself assembled from 3 units of IPP and 1 unit of DMAPP, and that 
also serves as a precursor for many other branches of the isoprenoid pathway in 
plants. 

Isoprenoids are formed from IPP and DMAPP in at least three different 

1 
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compartments of plants cells: the cytosol/endoplasmic reticulum, the mitochondria 
(and/or Golgi apparatus), and the plastids (McGarvey and Croteau, 1995). The 
source of IPP and DMAPP for isoprenoid biosynthesis in these different 
compartments has long been a matter of some controversy and debate (see Bach, 

5 1995; McGarvey and Croteau, 1995). The well-known "classical" or 
acetate/mevalonate route to IPP and DMAPP proceeds from acetyl-CoA via 3- 
hydroxy-3-methylglutaryl-CoA (HMG-CoA) and mevalonic acid (MVA). The critical, 
rate-determining and irreversible step in this route in animals is the reduction of 
HMG-CoA to produce MVA, catalyzed by the enzyme HMG-CoA reductase 

10 (HMGR; EC 1 .1 .1.34; reviewed in Goldstein and Brown, 1990). This enzyme also 
appears to provide an important control point for substrate flow into certain 
isoprenoids in plants (Bach, 1986; Chappell et al., 1995; Schaller et al., 1995; 
Stermer et al., 1994). HMGR has been localized in the cytosol of plants, 
associated with membranes of the endoplasmic reticulum (reviewed in Bach, 1995). 

15 There is no convincing experimental evidence that this key enzyme also resides in 
the plastids or mitochondria of plants. An uptake by plastids of IPP to support 
isoprenoid biosynthesis in this organelle has been suggested in the past (Kreuz and 
Kleinig, 1984). However, it has been difficult to reconcile a cytosolic source of IPP 
for plastid isoprenoids with the effective inhibition of cytoplasmically-made 

20 isoprenoids (e.g. sterols) but not of plastid-made isoprenoids (e.g. carotenoids) or 
mitochondrial ones (quinones) by mevinolin, a specific inhibitor of HMG-CoA 
reductase (discussed in Bach, 1995 and McGarvey and Croteau, 1995). 
Furthermore, the demonstrated ability of isolated chloroplasts of Acetabularia 
(Moore and Sheperd, 1977) and spinach (Schulze-Seibert and Schultz, 1987) to 

25 synthesize carotenoids and other isoprenoids from 14 C0 2 necessarily implies a 
chloroplast pathway for IPP biosynthesis. Further confusion arose from reports that 
radiolabelled acetate is efficiently incorporated by isolated chloroplasts into fatty 
acids but not into carotenoids (see Kleinig, 1989; Liedvogel, 1986; Schulze-Seibert 
and Schultz, 1987), a puzzling observation that has been explained by invoking a 

30 metabolic "channeling" that somehow renders the newly-formed acetyi-CoA pool 
available for fatty acid biosynthesis but unavailable for IPP biosynthesis (Kleinig, 
1989). 
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An attractive explanation for these many puzzling and apparently 
contradictory observations is provided by the suggestion of an "alternative" or non- 
mevalonate pathway for the biosynthesis of IPP and/or DMAPP in the chloroplasts 
of plants (Lichtenthaler et a!., 1997) and in algae (Schwender et al., 1996). 

5 Existence of a distinct non-mevalonate biochemical pathway to IPP and/or DMAPP, 
using pyruvate rather than acetate as a substrate, was posited for certain bacteria, 
including Escherichia coll as early as 1981 (Pandian ef a/., 1981), but this 
alternative pathway in bacteria received little attention until relatively recently (Zhou 
and White, 1991; Rohmer et al., 1993 and Horbach et al., 1993; see Bach, 1995 

10 for a critical discussion of this topic). It appears that many bacteria do not have the 
acetate/mevalonate pathway for biosynthesis of IPP. For instance, a BLAST 
search of the E. coli genome, the entire sequence for which has now been 
deposited in GenBank, does not identify any open reading frames with significant 
similarity to known bacterial, plant or mammalian HMG-CoA reductases. Nor can 

1 5 we discern an HMG-CoA reductase homologue in the genome sequence data base 
of the cyanobacterium Synechococystis 6803. Because ancestors of the 
cyanobacteria are the presumptive progenitors of plant chloroplasts, the non- 
mevalonate route to IPP in these photosynthetic procaryotes is likely to resemble 
that in plant chloroplasts. 

20 Recent reports indicate that plant and algal chloroplast isoprenoids, including 

carotenoids (Lichtenthaler et al., 1997; Schwender et al., 1996), and the medically- 
important isoprenoid paclitaxel (a.k.a. TAXOL®) (Eisenreich et al., 1996) are made 
from IPP supplied via a non-mevalonate pathway, Whether this pathway supplies 
IPP for carotenoids and other isoprenoids manufactured in the various 

25 nonphotosynthetic plant plastids (e.g. chromoplasts, leucoplasts, amyloploasts) or 
in etioplasts or developing chloroplasts is an open question. Several recent reports 
describe a thiamin-dependent enzyme from £. coli (Sprenger ef a/. r 1997; Lois ef 
ai t 1998) and from peppermint (Lange ef a/., 1998) that converts pyruvate and 
glyceraldehyde-3-phosphate (GAP) to deoxyxylulose-5-phosphate (DXP). A 

30 homologue of this enzyme was earlier described for Arabidopsis thaliana (Mandel 
et a/., 1996), but the function of the A. thaliana polypeptide had not been 
ascertained. 
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The evidence is compelling that DXP is a substrate in the pathway leading 
to IPP or DMAPP in plants, algae, and certain bacteria including E. coli (Arigoni et 
al., 1997). However, DXP also is a substrate leading to other essential compounds 
in these organisms, including thiamin and pyridoxal (see Lois et al., 1998). 
Therefore the biochemical reaction catalyzed by the DXP synthase enzyme is not 
dedicated or restricted to providing substrate for the pathway or pathways leading 
to IPP and/or DMAPP, and isoprenoids derived therefrom. Because the product 
of the reaction is shared with several other pathways, the DXP synthase enzyme 
would not seem to be an obvious candidate for a controlling step in the pathway 
leading to isoprenoids. An ability to modify isoprenoid production, enhancing 
production of desired compounds or reducing that of undesirable compounds, 
would be quite advantageous in many applications. 

Plausible routes to IPP from pyruvate and GAP via DXP have been 
suggested for plants and bacteria (Schwender et a/., 1 996; Rohmer et al, 1 993 and 
1996; see Bach, 1995). A gene encoding what may be the second enzyme in the 
pathway, DXP reductoisomerase, has recently been identified in Escherichia coli 
(Takahashi et al., 1998). The precise nature and sequence of subsequent 
biochemical reactions in the nonmevalonate pathway, and whether regulation of 
flux through this pathway occurs primarily at a single controlling step, as with HMG- 
CoA reductase for the mevalonate pathway, are so far unknown. We have taken 
a genetic approach to delineate the pathway leading to IPP and DMAPP in E. coli 
and plants. 

We earlier made the surprising observation that the reversible 
interconversion of IPP and DMAPP limits the accumulation of carotenoids in E. coli 
(an observation alluded to in Sun et al M 1996 and Sun et al., 1998). Others 
(Kajiwara et al., 1997) have made a similar observation. As a direct result of 
continuing to pursue the same empirical approach (using a colored isoprenoid 
compound to "report" on the amount of IPP and DMAPP available for isoprenoid 
biosynthesis) which revealed that the isomerization of IPP is a limiting step in 
isoprenoid biosynthesis (see U.S. Patent No. 5,744,341), we have now identified 
several other genes which can be used to modify IPP, DMAPP and/or isoprenoid 
production in host cells. 



WO 99/58649 



PCI7US99/07041 



For two of the these genes the functions of the products, glyceraldehyde-3- 
phosphate dehydrogenase and DXP synthase, are well known. The role of the 
product of the third, the lytB gene, is still obscure. Because certain mutations in 
this gene in Escherichia coli confer an ability to tolerate elevated levels of penicillin 

5 (Harkness et al., 1992; Gustafson et al., 1993) the lytB gene product would seem 
to play some role in or otherwise influence the biosynthesis of peptidoglycan. It has 
been suggested that the lytB gene product is not an enzyme directly involved in the 
pathway leading to peptidoglycan but rather exerts some control on the production 
of the global regulator molecule guanosine 3\5'-bisphosphate (ppGpp) (Gustafson 

10 et al., 1993; Rodionov and Ishiguro, 1995). The evidence in support of this 
suggestion is correlative and indirect and therefore by no means convincing. 
Because lytB influences the accumulation of certain isoprenoids in £ coli (see 
below), because an analysis of the nearly twenty completed bacterial genomes 
reveals it to be one of the very few genes found only in those bacteria that also 

15 contain genes encoding DXP synthase and DXP reductoisomerase, and because 
we have isolated an apparently chloroplast-targeted lytB cDNA from a eucaryotic 
plant, we speculate that lytB may encode an enzyme that catalyzes one of the later 
reactions in the nonmevalonate pathway leading to IPP and DMAPP in certain 
bacteria and in plant chloroplasts. To avoid confusion, it is important to note that 

20 another gene, which is completely unrelated to the gene under consideration here, 
has unfortunately also been given the name lytB (see, e.g., Garcia et al., 1999). 

There remains a need for methods to enhance the production of desirable 
isoprenoids and reduce the accumulation of undesirable isoprenoids in plants, fungi 
and bacteria that contain a nonmevalonate pathway. There also remains a need 

25 in the art for methods for screening for procaryotic and eukaryotic genes encoding 
enzymes of isoprenoid biosynthesis and metabolism. 

ftl 1MMARY OF THF INVENTION 
A subject of the present invention is an isolated nucleic acid sequence which 
encodes for a protein having DXP synthase enzyme activity, wherein the nucleic 
30 acid sequence is at least 85% identical to SEQ ID NO: 1 or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
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85% identical to SEQ ID NO: 2. 

Another subject of the present invention is an isolated nucleic acid sequence 
which encodes for a protein having GAP dehydrogenase enzyme activity, wherein 
the nucleic acid sequence is at least 85% identical to SEQ ID NO: 3 or the nucleic 
5 acid sequence encodes a protein which has an amino acid sequence which is at 
least 85% identical to SEQ ID NO: 4. 

A further subject of the present invention is an isolated nucleic acid 
sequence which encodes for a protein having LYTB activity, wherein the nucleic 
acid sequence is at least 85% identical to SEQ ID NO: 5 or the nucleic acid 
10 sequence encodes a protein which has an amino acid sequence which is at least 
85% identical to SEQ ID NO: 6. 

The invention also includes vectors which comprise any of the nucleic acid 
sequences listed above, and host cells transformed with such vectors. 

Another subject of the present invention is a method of enhancing the 
15 production of IPP, DMAPP and/or an isoprenoid compound in a host cell, 
comprising inserting into the host cell a vector comprising a heterologous nucleic 
acid sequence which encodes for a protein having DXP synthase, GAP 
dehydrogenase and/or LYTB activity, wherein the heterologous nucleic acid 
sequence is operably linked to a promoter; and expressing the heterologous nucleic 
20 acid sequence, thereby producing the protein. 

Yet another subject of the present invention is a method of modifying the 
production of IPP, DMAPP and/or an isoprenoid compound in a host cell, 
comprising inserting into the host cell a vector comprising a heterologous nucleic 
acid sequence which encodes for a protein which modifies DXP synthase, GAP 
25 dehydrogenase and/or LYTB activity in the host cell, wherein the heterologous 
nucleic acid sequence is operably linked to a promoter; and expressing the 
heterologous nucleic acid sequence, thereby producing the protein. 

The present invention also includes a method of expressing, in a host cell, 
a heterologous nucleic acid sequence which encodes for a protein having DXP 
30 synthase, GAP dehydrogenase and/or LYTB activity, the method comprising 
inserting into the host cell a vector comprising the heterologous nucleic acid 
sequence, wherein the heterologous nucleic acid sequence is operably linked to a 
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promoter; and expressing the heterologous nucleic acid sequence. 

Also included is a method of expressing, in a host cell, a heterologous 
nucleic acid sequence which encodes for a protein which modifies DXP synthase, 
GAP dehydrogenase and/or LYTB activity in the host cell, the method comprising 

5 inserting into the host cell a vector comprising the heterologous nucleic acid 
sequence, wherein the heterologous nucleic acid sequence is operably linked to a 
promoter; and expressing the heterologous nucleic acid sequence. 

Another subject of the present invention is to provide a method for screening 
for eukaryotic genes which encode enzymes involved in isoprenoid biosynthesis 

10 and metabolism. 

BRIEF DESCRIPTION OF THF DRAWINGS 
A more complete appreciation of the invention and many of the attendant 
advantages thereof will be readily obtained as the same becomes better 
understood by reference to the following detailed description when considered in 
15 connection with the accompanying drawings, wherein: 

Figure 1 is an illustration showing that IPP and DMAPP serve as the central 
metabolites leading to an immense variety of different isoprenoid compounds in 
plants, bacteria, and other organisms. The mevalonic acid pathway via acetyl-CoA 
in the cytosol of plants and in animals is well characterized. A second route from 

20 pyruvate and GAP, not yet elucidated, is thought to operate in the chloroplasts of 
plants and algae, in cyanobacteria, and in many bacteria. Later reaction steps in 
the pathway remain to be determined and are therefore denoted with question 
marks. The pathway is shown leading to a box containing IPP and DMAPP 
because it has not been established which of these two is the initial product of this 

25 pathway. Abbreviations: G3P, glyceraldehyde-3-phosphate; G3PD, GAP 
dehydrogenase; DXPS, DXP synthase; IPP, isopentenyl pyrophosphate; IPI, IPP 
isomerase; DMAPP, dimethylallyl pyrophosphate; FPP, farnesyl pyrophosphate; 
GPP geranyl pyrophosphate; GGPP, geranylgeranyl pyrophosphate. 

Figure 2 illustrates a route to IPP/DMAPP from pyruvate and glyceraldehyde-3- 
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phosphate (GAP), which is thought to operate in the chloroplasts of plants and 
algae, in cyanobacteria, and in many bacteria including £. colL Later reaction steps 
in the pathway remain to be determined and are therefore denoted with question 
marks. The pathway is shown leading to a box containing IPP and DMAPP 

5 because it has not been established which of these two is the initial product of this 
pathway. Enzymes of interest are shown in bold white text in a black box. 
Abbreviations: DMAPP, dimethylallyl pyrophosphate; DXPS, deoxyxylulose-5- 
phosphate synthase; DXPR, deoxyxylulose-5-phosphate reductoisomerase; FPS, 
famesyl pyrophosphate synthase, GAPD, glyceraldehyde-3-phosphate 

10 dehydrogenase; GGPP, geranylgeranyl pyrophosphate; IPI, isopentenyl 
pyrophosphate isomerase; IPP, isopentenyl pyrophosphate. 

Figure 3 illustrates that the C 40 carotenoid phytoene is derived by a head-to-head 
condensation of two molecules of the C 20 GGPP compound, which itself is 
assembled from 3 molecules of IPP and 1 molecule of DMAPP. Abbreviations: 
15 FPP, famesyl pyrophosphate; GPP geranyl pyrophosphate; PPPP, prephytoene 
pyrophosphate. 

Figure 4 is a schematic illustration and restriction map of plasmid pAC-LYC 
(Cunningham et al., 1994) which contains genes (crtE, crtB, and crtl) of Erwinia 
herbicola encoding all of the enzymes required for production of the pink-colored 
20 isoprenoid pigment lycopene from the colorless IPP and DMAPP compounds. Cm, 
chloramphenicol resistance gene. 

Figure 5A is a cDNA sequence and Figure 5B is the predicted amino acid sequence 
of a putative DXP synthase isolated from a flower cDNA library of Tagetes erecta 
(SEQ ID NOS: 1 and 2). The cDNA is incorporated into the plasmid pMarDXPS. 

25 Figure 6A is a cDNA sequence and Figure 6B is the predicted amino acid sequence 
of a chloroplast isoform of GAP dehydrogenase isolated from Arabidopsis thaliana 
(SEQ ID NOS: 3 and 4). The cDNA is incorporated into the plasmid pAtG3PD. 
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Figure 7A is a cDNA sequence and Figure 7B is the predicted amino acid sequence 
of a LYTB protein isolated from an Adonis palaestina flower cDNA library (SEQ ID 
NOS: 5 and 6). The sequence is of clone Ipi3 except that the 14 bp at the N- 
terminus were obtained from the slightly longer cDNA clone Ipi18. The cDNA is 
5 incorporated into the plasmid pApLYTB. 

Figure 8 is an alignment of the predicted amino acid sequences of LYTB from 
Adonis palaestina, Synechocystis PCC6803 and E coli. The N-terminal extension 
of the Adonis polypeptide, relative to that of Synechocystis PCC6803, is predicted 
by the program ChloroP (Emanuelsson et al. f 1999) to constitute a chloroplast 
10 transit peptide, serving to target the polypeptide to this organelle in plants. Black 
boxes with white letters are used where all three of the aligned residues are 
identical. Grey boxes with black letters are used where two of the three aligned 
residues are identical. Abbreviations: Ap ( Adonis palaestina] Sy, Synechocystis 
PCC6803; Ec, Escherichia coli. 

15 Figure 9 is a schematic representation of the mapping of an E coli genomic 
fragment to ascertain which of the genes in this fragment will enhance or impair 
lycopene accumulation in E coli. Deletion mapping of an E coli genomic fragment 
(the insert in plasmid pEc3.9 is essentially identical to GenBank U32768: 
4879..8819) with genes encoding DXP synthase (orf620), FPP synthase {ispA) and 

20 the small subunit of exonuclease VII (xseS). Numbers to the right indicate relative 
lycopene accumulation per mL of liquid culture with plasmids in lycopene- 
accumulating E coli strain TOP10. The insert in pEc3.9 is oriented in the forward 
direction in the multicopy plasmid vector pBluescript SK-. The EcoR\ and Smal 
sites in the vector preceding the genomic fragment and a Kpn\ site following it were 

25 used, along with Nde\ t Smal and Sail sites in the genomic DNA, to construct the 
deletion subclones illustrated. Incomplete genes and open reading frames are not 
shown. 

DESCRIPTION OF THF PREFERR FD EMBODIMENTS 
The term "isoprenoid" is intended to mean any member of the class of 

9 
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naturally occurring compounds whose carbon skeletons are composed, in part or 
entirely of isopentyl C 5 units. Preferably, the carbon skeleton is of an essential oil, 
a fragrance, a rubber, a carotenoid, or a therapeutic compound, such as paclitaxel. 

A search of the public databases (GenBank) for genes encoding 
homologues of the E. coli lytB gene product unearthed such genes in twelve 
bacteria and in the cyanobacterium Synechocystis PCC6803. A recent publication 
(Potter et aL, 1998) discusses the distribution of this gene in various bacteria. Our 
description herein of a homologue in the plant Adonis palaestina reveals for the first 
time that this gene is present and expressed in a eucaryotic organism. A recently 
deposited genomic DNA sequence for the green plant Arabidopsis thaliana 
(GenBank accession number AL035521 ) contains what appears to be a gene (the 
probable coding sequence is interrupted by several apparent introns) encoding 
LYTB in this organism. The predicted sequence of this Arabidopsis LYTB is 
somewhat uncertain (a comparison of the Adonis sequence with that listed given 
in AL035521 for Arabidopsis suggests that several of the exon-intron junctions 
predicted for this gene in the GenBank record are incorrect), but sequence identity 
in a comparison with the Adonis sequence is ca. 80% or more. Partial cDNA 
sequences in the data base of expressed sequence tags (dbEST) that predict 
peptides with sequence similarity to portions of the Adonis LYTB sequence indicate 
that homologues exist and mRNAs encoding LYTB are produced in several other 
plant species including rice (D45948), loblolly pine (AA556723) and soybean 
(AI437981). Both the Adonis and Arabidopsis predicted amino acid sequences are 
more than 60% identical to that predicted by the cyanobacterium Synechocystis 
PCC6803 gene, and the two plant and the cyanobacterial sequences are more than 
30% identical to the predicted E. coli gene product. An alignment of the Adonis, 
Synechocystis PCC6803 and E. coli predicted amino acid sequences is shown in 
Figure 7. A number of regions and residues conserved in LYTB are indicated in 
this Figure. 

The term "LYTB activity" is intended to mean the ability of LYTB to affect the 
production of IPP, DMAPP and/or isoprenoids in a host cell containing the lytB 
gene or DNA copy of the lytB mRNA. It has not yet been confirmed that the LYTB 
protein is, in fact, an enzyme. The precise role of the LYTB protein in affecting the 
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production of isoprenoids has not been established. However, whatever the 
mechanism of action of the lytB gene product, we intend to cover that mechanism 
of action by the term "LYTB activity". 

The present invention is directed to an isolated nucleic acid sequence which 
encodes for a protein having DXP synthase enzyme activity, wherein the nucleic 
acid sequence is at least 85% identical to SEQ ID NO: 1 or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
85% identical to SEQ ID NO: 2. Preferably, the nucleic acid sequence is at least 
90%, at least 95% or completely identical to SEQ ID NO: 1 , or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
90%, at least 95% or completely identical to SEQ ID NO: 2. 

Another subject of the present invention is an isolated nucleic acid sequence 
which encodes for a protein having GAP dehydrogenase enzyme activity, wherein 
the nucleic acid sequence is at least 85% identical to SEQ ID NO: 3 or the nucleic 
acid sequence encodes a protein which has an amino acid sequence which is at 
least 85% identical to SEQ ID NO: 4. Preferably, the nucleic acid sequence is at 
least 90%, at least 95% or completely identical to SEQ ID NO: 3, or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
90%, at least 95% or completely identical to SEQ ID NO: 4. 

A further subject of the present invention is an isolated nucleic acid 
sequence which encodes for a protein having LYTB activity, wherein the nucleic 
acid sequence is at least 85% identical to SEQ ID NO: 5 or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
85% identical to SEQ ID NO: 6. Preferably, the nucleic acid sequence is at least 
90%, at least 95% or completely identical to SEQ ID NO: 5, or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
90%, at least 95% or completely identical to SEQ ID NO: 6. 

In each case, sequence similarity is measured using sequence analysis 
software, for example, the Sequence Analysis software package of the Genetics 
Computer Group (University of Wisconsin Biotechnology Center, 1710 University 
Avenue, Madison, Wisconsin 53705), MEGAIign (DNAStar, Inc., 1228 S. Park St., 
Madison, Wisconsin 53715), or MacVector (Oxford Molecular Group, 2105 S. 
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Bascom Avenue, Suite 200, Campbell, California 95008). Such software matches 
similar sequences by assigning degrees of identity to various substitutions, 
deletions, and other modifications. Conservative (i.e. similar) substitutions typically 
include substitutions within the following groups: glycine and alanine; valine, 
isoleucine and leucine; aspartic acid, glutamic acid, asparagine and glutamine; 
serine and threonine; lysine and arginine; and phenylalanine and tyrosine. 
Substitutions may also be made on the basis of conserved hydrophobicity or 
hydrophilicity (see Kyte and Doolittle, 1 Mol. BioL 157: 105-132 (1982)), or on the 
basis of the ability to assume similar polypeptide secondary structure (see Chou 
and Fasman, Adv. Enzymol. 47: 45-148 (1978)). If comparison is made between 
nucleotide sequences, preferably the length of comparison sequences is at least 
50 nucleotides, more preferably at least 60 nucleotides, at least 75 nucleotides or 
at least 100 nucleotides. It is most preferred if comparison is made between the 
nucleic acid sequences encoding the protein coding regions necessary for protein 
activity. If comparison is made between amino acid sequences, preferably the 
length of comparison is at least 20 amino acids, more preferably at least 30 amino 
acids, at least 40 amino acids or at least 50 amino acids. It is most preferred if 
comparison is made between the amino acid sequences in the protein coding 
regions necessary for protein activity. 

The present inventors have isolated eukaryotic genes encoding DXP 
synthase from Tagetes erecta (marigold), GAP dehydrogenase from Arabidopsis 
thaltana, and LYTB from Adonis palaestina. All were identified on the basis of an 
enhancement of lycopene accumulation in E. coll The E. coli DXP synthase was 
also identified in this same way. 

Suitable vectors according to the present invention comprise a gene 
encoding one or more of the above-identified enzymes involved in IPP, DMAPP 
and/or isoprenoid biosynthesis or metabolism, wherein the gene is operably linked 
to a suitable promoter. Suitable promoters for the vector can be constructed using 
techniques well known in the art (see, for example, Sambrook et at., Molecular 
Cloning A Lahnratnry Manual . Cold Spring Harbor Laboratory, Cold Spring Harbor, 
NY, 1989; Ausubel et al., nurmnt Prot nmk in Modular Biology. Greene 
Publishing and Wiley Interscience, New York, 1991). Suitable vectors for 
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eukaryotic expression in plants are described in Frey et al., Plant J. (1995) 8(5):693 
and Misawa et al, 1994a. Suitable vectors for prokaryotic expression include 
pACYC184 ( pUC1 19, and pBR322 (available from New England BioLabs, Bevery, 
MA) and pTrcHis (Invitrogen) and pET28 (Novagen) and derivatives thereof. The 

5 vectors of the present invention can additionally contain regulatory elements such 
as promoters, repressors, selectable markers such as antibiotic resistance genes, 
etc., the construction of which is very well known in the art. 

One or more of the genes encoding the enzymes as described above, when 
cloned alone or in combination into a suitable expression vector, can be used to 

10 overexpress these enzymes in a plant expression system or to inhibit the 
expression of these enzymes. For example, a vector containing one or more of the 
genes of the invention may be used to increase the amount of isoprenoids in an 
organism and thereby alter the nutritional or commercial value or pharmacology of 
the organism. 

15 Therefore, the present invention includes a method of enhancing the 

production of IPP, DMAPP and/or an isoprenoid in a host cell, relative to an 
untransformed host cell, the method comprising inserting into the host cell a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein 
having DXP synthase, GAP dehydrogenase and/or LYTB activity, wherein the 

20 heterologous nucleic acid sequence is operably linked to a promoter; and 
expressing the heterologous nucleic acid sequence. 

The invention also includes a method of modifying the production of IPP, 
DMAPP and/or an isoprenoid in a host cell, the method comprising inserting into 
the host cell a vector comprising a heterologous nucleic acid sequence which 

25 encodes for a protein which modifies DXP synthase, GAP dehydrogenase and/or 
LYTB activity in the host cell, relative to an untransformed host cell, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter; and 
expressing the heterologous nucleic acid sequence. 

The invention further includes a method of expressing, in a host cell, a 

30 heterologous nucleic acid sequence which encodes for a protein having DXP 
synthase, GAP dehydrogenase and/or LYTB activity, the method comprising 
inserting into the host cell a vector comprising the heterologous nucleic acid 
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sequence, wherein the heterologous nucleic acid sequence is operably linked to a 
promoter, and expressing the heterologous nucleic acid sequence. 

The invention also includes a method of expressing, in a host cell, a 
heterologous nucleic acid sequence which encodes for a protein which modifies 
DXP synthase, GAP dehydrogenase and/or LYTB activity in the host cell, relative 
to an untransformed host cell, the method comprising inserting into the host cell a 
vector comprising the heterologous nucleic acid sequence, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter, and 
expressing the heterologous nucleic acid sequence. 

Preferably, the isoprenoid comprises a compound derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
farnesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). By the 
term "derived from", we mean that the isoprenoid contains at least one of the listed 
compounds as a building block. It is also preferable if the isoprenoid comprises at 
least one member selected from the group consisting of a diterpene, a carotenoid, 
an essential oil, a fragrance, an isoprene, a cytokinin, a rubber, a quinone, a sterol, 
a hopanoid, a triterpene, a steroid, a prenylated protein, a phytoalexin, a gibberellin, 
a tocopherol, a dolichol, a chlorophyll and a therapeutic compound. 

It is most preferred if the isoprenoid comprises at least one member selected 
from the group consisting of an essential oil, a fragrance, a rubber, a carotenoid 
and a therapeutic compound, such as paclitaxel. 

The heterologous nucleic acid sequence may originate from a eukaryotic or 
procaryotic cell. By the term "originate from", we intend to mean that the sequence 
information for the heterologous nucleic acid came from the eukaryotic or the 
procaryotic cell. However, the specific nucleic acid itself does not have to be from 
the organism. The nucleic acid may come from the organism, or it may be 
synthetically produced using recombinant nucleic acid techniques known in the art. 

Preferably, the heterologous nucleic acid sequence comprises a nucleotide 
sequence for dxps, gapd and/or lytB. It is most preferred that the heterologous 
nucleic acid sequence comprises a nucleotide sequence which encodes a dxps, 
gapd and/or a lytB gene and is at least 85% identical, preferably at least 90%, at 
least 95% or completely identical, to SEQ ID NO: 1, 3 and/or 5, respectively, or the 
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nucleic acid sequence encodes a protein which has an amino acid sequence which 
is at least 90%, at least 95% or completely identical to SEQ ID NO: 2, 4 and/or 6, 
respectively. Identity is determined as noted above. 

The term "modifying the production" in the methods of the invention means 
that the amount of target compounds produced (e.g., IPP, DMAPP and/or 
isoprenoids) can be enhanced or reduced, as compared to an untransformed host 
cell. Thus, in accordance with an embodiment of the present invention, the 
production or the biochemical activity of the target compounds (or the enzymes 
which catalyze their formation) may be reduced or inhibited by a number of different 
approaches available to those skilled in the art, including but not limited to such 
methodologies or approaches as anti-sense (e.g., Gray et al., 1992), ribozymes 
(e.g., Wegener et al. f 1994), co-suppression (e.g. Fray and Grierson, 1993), 
targeted disruption of the gene (e.g., Schaefer et al., 1997), intracellular antibodies 
(e.g., see Rondon and Marasco, 1997) or whatever other approaches rely on the 
knowledge or availability of the nucleic acid sequences of the invention, or the 
proteins encoded thereby. 

Host systems according to the present invention can comprise any organism 
that utilizes a nonmevalonate (i.e., via DXP) pathway for production of IPP and/or 
DMAPP. Organisms which produce isoprenoids using IPP and/or DMAPP derived 
from a nonmevalonate pathway include plants, algae, certain bacteria, 
cyanobacteria and other photosynthetic bacteria. Transformation of these hosts 
with vectors according to the present invention can be done using standard 
techniques. See, for example, Sambrook et al M Molecular Cloning A Laboratory 
Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1989; Ausubel 
et al., Current Protocols in Molecular Biology . Greene Publishing and Wiley 
Interscience, New York, 1991. 

Alternatively, transgenic organisms can be constructed which include the 
nucleic acid sequences of the present invention. The incorporation of these 
sequences can allow the controlling of isoprenoid biosynthesis, content, or 
composition in the host cell. These transgenic systems can be constructed to 
incorporate sequences which allow for the overexpression of the various nucleic 
acid sequences of the present invention. Transgenic systems can also be 
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constructed which allow for the underexpression of the various nucleic acid 
sequences of the present invention. Such systems may contain anti-sense 
expression of the nucleic acid sequences of the present invention. Such anti-sense 
expression would result in the accumulation of the substrates of the enzyme 
encoded by the sense strand. 

Having generally described this invention, a further understanding can be 
obtained by reference to certain specific examples which are provided herein for 
purposes of illustration only and are not intended to be limiting unless otherwise 
specified. 

FXAMPLE 1 

isolation of plant cDN As an d bacterial genes that enhance ca rotenoid accumulation 
in FschGrichia coli 

The plasmid pAC-LYC (Cunningham et al., 1994 and 1996) contains genes 
encoding all of the enzymes required for the formation of lycopene from IPP and 
DMAPP (see Figure 3). Cells of E.coli containing the plasmid pAC-LYC accumulate 
the carotenoid lycopene and thereby form colonies on solid growth medium that are 
pink in color (Cunningham et al., 1994). When cDNA or genomic libraries of 
plants, cyanobacteria, or bacteria are introduced into the lycopene-accumulating 
£ coli strain (the complete methodology is contained in Cunningham et al., 1994 
and 1996), a rare few of the colonies formed after spreading of the cell culture onto 
solid growth medium are much deeper in color than is typical of the vast majority 
of the colonies. The enhancement of color and concomitant carotenoid pigment 
accumulation in £ coli is prima facie evidence for an enhancement of flux into and 
through the pathway or pathways leading to the immediate precursors of 
carotenoids and all other isoprenoid compounds: IPP and its allylic isomer DMAPP. 
Genes encoding subsequent enzymes in the pathway leading to lycopene are 
already present in the multicopy plasmid pAC-LYC (see Figure 3). 

Among the cDNAs and genes isolated with this screening methodology are: 
Tagetes erecta (marigold) cDNAs encoding a homologue of DXP synthase (the 
DNA sequence of the longest is given in Figure 4) t an E.coli genomic clone 
containing the DXP synthase (and no other complete open reading: a genomic 
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fragment from the Sph\ site which lies at base 5210 in the sequence listed under 
GenBank U32768 to the A/del site at base 8003 in this GenBank sequence was 
found to enhance carotenoid accumulation in £. coli. This region encompasses the 
open reading frame earlier referred to as f620, and now as the DXP synthase), 
Arabidopsis thaliana cDNAs encoding a chloroplast isoform (Shih et a/., 1991 and 
1 992) of the enzyme GAP dehydrogenase (the sequence of the longest of these 
isolated cDNAs is given in Figure 5),and an Adonis palaestina cDNA encoding a 
homologue of the £. coli lytB gene product (Figure 6). The enhancement of 
isoprenoid biosynthesis in £. coli (as indicated by lycopene accumulation) when 
cDNAs or genes encoding DXP synthase or GAP dehydrogenase are introduced 
can be understood in the context of the biochemical pathway that has been 
postulated for production of IPP/DMAPP in E. coli (and in other bacteria, 
cyanobacteria and plants). However, at the date of the provisional filing of this 
application, there had been no indications that increased flux through the pathway 
to IPP/DMAPP might be obtained by increasing the expression of either of these 
enzymes. A publication after the priority date of this application (Harker and 
Bramley (1999)) shows a salutary effect on isoprenoid accumulation in E. coli in 
which foreign DXP synthase genes are introduced. 

The mechanism of enhancement of isoprenoid production by introduction of 
cDNAs encoding LYTB is not yet known. The nonmevalonate pathway leading to 
IPP and/or DMAPP has not yet been elucidated and these cDNAs may encode an 
enzyme subsequent to DXP synthase in the pathway. Alternatively, the 
enhancement of isoprenoid accumulation may involve a mechanism less direct 
(e.g., as for GAP dehydrogenase, involvement in biochemical reactions that utilize 
or supply the substrates GAP and pyruvate, or by exerting a regulatory influence 
on isoprenoid pathways). 

EXAMPLE 2 

Arabidopsis thaliana Tagates erec fe , and Adonis palaestina cONA Libraries 

A size-fractionated 2-3 kB cDNA library of A. thaliana in lambda ZAPII 
(Kieber et al. f 1 993) was obtained from the Arabidopsis Biological Resource Center 
at The Ohio State University (stock number CD4-15). Other size fractionated 
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libraries were also obtained (stock numbers CD4-13, CD4-14, and CD4-16). The 
cDNA libraries for Tagetes erecta (marigold) and Adonis palaestina (pheasant's 
eye) were constructed for us by Stratagene using cDNAs isolated from flower 
tissues. An aliquot of each library was treated to cause a mass excision of the 
cDNAs and thereby produce a phagemid library according to the instructions 
provided by the supplier of the cloning vector (Stratagene; E. coli strain XL1-Blue 
and the helper phage R408 were used). The titre of the excised phagemid was 
determined and the library was introduced into a lycopene-accumulating strain of 
£ coli TOP10 F (this strain contained the plasmid pAC-LYC) by incubation of the 
phagemid with the E. coli cells for 1 5 min at 37°C. Cells had been grown overnight 
at 30°C in LB medium supplemented with 2% (w/v) maltose and 10 mM MgS0 4 
(final concentration), and harvested in 1.5 ml microfuge tubes at a setting of 3 on 
an Eppendorf microfuge (541 5C) for 10 min. The pellets were resuspended in 10 
mM MgS0 4 to a volume equal to one-half that of the initial culture volume. 
Transformants were spread on large (150 mm diameter) LB agar petri plates 
containing antibiotics to provide for selection of cDNA clones (ampicillin) and 
maintenance of pAC-LYC (chloramphenicol). Approximately 1 0,000 colony forming 
units were spread on each plate. Petri plates were incubated at room temperature 
for 2 to 7 days to allow maximum color development. Plates were screened visually 
with the aid of an illuminated 3x magnifier and a low power stage-dissecting 
microscope for the rare deep pink colonies that could be observed in the 
background of paler pink colonies. 



FXAMPLE 3 

Enhancement of Carotenoid Accum u lation in E. coli bv cDNAs and genes encoding 
LYTB r DXP synthase. IPP isomerase and GAP Dehydrogenase Individually and in 
Combination. 

Attempts were made to maximize production of three very different 
carotenoids in E. coli engineered to accumulate these isoprenoid pigments. The 
specific carotenoids chosen were the linear and lipophilic pink compound 
lycopene, the bicyclic hydrocarbon (i-carotene and the much more polar dihydroxy, 
bicyclic carotenoid zeaxanthin. DXP synthase, IPP isomerase, LYTB, and GAP 
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dehydrogenase had been observed to enhance the pigmentation of £. coli colonies 
growing on solid agar medium. For a more quantitative appraisal of the influence 
of these genes and cDNAs on carotenoid accumulation, the amounts of carotenoid 
pigments in liquid cultures were examined with the results given in Figure 1 . The 
marigold DXP synthase, an Arabidopsis IPP isomerase and the Adonis LYTB 
cDNAs gave rise to significant and substantial increases in the amount of 
carotenoids accumulated per volume of culture. A partially-sequenced Arabidopsis 
cDNA (GenBank AA605545) with sequence similarity to the Adonis lytB cDNA gave 
results similar to that of the Adonis cDNA (not shown). A synechocystis PCC6803 
lytB gene was less effective but gave a significant enhancement as well. See Table 
1. GAP dehydrogenase was slightly detrimental to pigment accumlation in liquid 
culture, in contrast to earlier observed enhancement for cultures grown on solid 
media. The influence of this gene on pigment accumulation may depend on the 
specific growth regimen. A similar result, enhancement for cultures on solid media 
but reduction in liquid culture, was also obtained for an £. coli genomic fragment 
containg DXP synthase (see Example 4 below and Figure 8). A combination of 
DXP synthase with IPP isomerase was significantly more effective at enhancing 
pigment accumulation than was either of the individual cDNAs. 

FXAMPLE 4 

Enhancement and Reduction of Ly nnpene Accumulation in F. coli containing 
multiple copies of genes encoding DXP synthase and Fa rn e syl Pyrophosphate 

synthase, respectiyely 

A number of the inserts in piasmids obtained from dark pink colonies 
selected in screens of the E. coli genomic library (see example 1) contained the 
gene encoding DXP synthase in this organism (see Sprenger et al., 1997; Lois et 
al., 1998). Deletion mapping of one such E coli genomic fragment indicated that 
the dark pink colony phenotype and enhanced carotenoid accumulation in liquid 
culture was conferred by a fragment containing the DXP synthase and no other 
complete open reading (Figure 8). A genomic fragment containing the ispA and 
xseB genes immediately upstream of dxps was, in contrast, quite inhibitory for 
carotenoid accumulation (Figure 8). This diminution in pigmentation results, we 
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surmise, from the activity of the farnesyl pyrophosphate (FPP) synthase encoded 
by ispA (Fujisaki et al. f 1990). The FPP synthase would be expected to compete 
with geranylgeranyl pyrophosphate (GGPP) synthase (Figure 1) for IPP and 
DMAPP and thereby deplete the substrate available for carotenoid biosynthesis. 

Table 1 illustrates the influence of cDNAs encoding DXP synthase, GAP 
dehydrogenase, LYTB, and IPP isomerase, individually and in combination, on the 
accumulation of lycppene, p-carotene, or zeaxanthin in three strains of E. coli 



engineered to produce these isoprenoid compounds. 



Table 1. Enhancement of carotenoid accumulation in lycopene, fa- 
carotene, and zeaxanthin-accumulating strains of E coli by introduction of 
plant cDNAs and cyanobacterial genes individually and in combination. 


Plasmid 


cDNA or 
Gene 
Product 


Lycopene 
% of control 


b-Carotene 
% of control 


Zeaxanthin 
% of control 


pBluescript SK- 


Control (empty 
vector) 


(100) 


(100) 


(100) 


pAtipiTrc 


Arabidopsis 
thaliana IP I 


202 ±12 


227 ±3 


223 ±4 


pTedxps 


Tagetes erecta 
DXPS 


203 ± 1 1 


193 ±11 


228 ± 40 


pAdlytB 


Adonis 

palaestina LYTB 


157 ±9 


180 ±3 


168 ±2 


p6803lytB 


Synechocystis 
PCC 6803 LYTB 


116± 11 


111 ± 1 


109 ±2 


pAtgapA 


Arabidopsis 
thaliana GAPD 


79 ±6 


nd 


79 ±3 


Data were obtained with E. coli strain TOP10 containing pAC-LYC, pAC-BETA or 
pAC-ZEAX (Cunningham et al. ? 1994 and 1996, Sun et al., 1996). Carotenoid 
content (per volume of culture) was measured after 48-72 h growth in LB medium. 
Cultures were inoculated with freshly grown individual colonies from agar plates. 
Values are mean ± sd for 4 to 13 individual cultures. The empty plasmid pBluescript 
SK* (from Stratagene) served as the control and was used as the vector for all of the 
plant cDNAs and the Synechocystis lytB gene. The influence of GAPD on 
accumulation of b-carotene was not determined (nd). 
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We claim: 

1 . A method of enhancing the production of isopentenyl pyrophosphate (IPP), 
dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, relative 
to an untransformed host cell the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic 
acid sequence which encodes for a protein having deoxyxylulose-5-phosphate 
(DXP) synthase enzyme activity, wherein the heterologous nucleic acid sequence 
is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby 

producing the protein. 

2. The method of claim 1 , wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
farnesyl pyrophosphate (FPP) and geranyigeranyl pyrophosphate (GGPP). 

3. The method of claim 1, wherein the isoprenoid comprises at least one 
member selected from the group consisting of an essential oil, a fragrance, a 
rubber, a carotenoid and a therapeutic compound. 

4. The method of claim 1 , wherein the isoprenoid comprises a carotenoid. 

5. The method of claim 1 , wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for dxps. 

6. The method of claim 5, wherein the nucleic acid sequence for dxps encodes 
an amino acid sequence which is completely identical to SEQ ID NO: 2. 

7. The method of claim 1, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell, 

8. The method of claim 1, wherein the heterologous nucleic acid sequence 
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originates from a eukaryotic cell. 

9. The method of claim 1, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

10. A method of modifying the production of isopentenyl pyrophosphate (IPP), 
dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, the 
method comprising 

inserting into the host cell a vector comprising a heterologous nucleic 
acid sequence which encodes for a protein which modifies deoxyxylulose-5- 
phosphate (DXP) synthase enzyme activity in the host cell, relative to an 
untransformed host cell, wherein the heterologous nucleic acid sequence is 
operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby 
producing the protein. 

1 1 . The method of claim 10, wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
famesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

12. The method of claim 10, wherein the isoprenoid comprises at least one 
member selected from the group consisting of an essential oil, a fragrance, a 
rubber, a carotenoid and a therapeutic compound. 

13. The method of claim 10, wherein the isoprenoid comprises a carotenoid. 

14. The method of claim 10, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

15. The method of claim 10, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 
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16. The method of claim 10, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

17. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
which encodes for a protein having deoxyxylulose-5-phosphate (DXP) synthase 
enzyme activity, the method comprising inserting into the host cell a vector 
comprising the heterologous nucleic acid sequence, wherein the heterologous 
nucleic acid sequence is operably linked to a promoter, and expressing the 
heterologous nucleic acid sequence. 

18. The method of claim 17, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for dxps. 

19. The method of claim 18, wherein the nucleic acid sequence for dxps 
encodes an amino acid sequence which is completely identical to SEQ ID NO: 2. 

20. The method of claim 17, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

21 . The method of claim 17, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

22. The method of claim 17, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

23. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
which encodes for a protein which modifies deoxyxylulose-5-phosphate (DXP) 
synthase enzyme activity in the host cell, relative to an untransformed host cell, the 
method comprising inserting into the host cell a vector comprising the heterologous 
nucleic acid sequence, wherein the heterologous nucleic acid sequence is operably 
linked to a promoter, and expressing the heterologous nucleic acid sequence. 
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24. The method of claim 23, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

25. The method of claim 23, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

26. The method of claim 23, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

27. A method of enhancing the production of isopentenyl pyrophosphate (IPP) ( 
dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, relative 
to an untransformed host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having glyceraldehyde-3-phosphate (GAP) 
dehydrogenase enzyme activity, wherein the heterologous nucleic acid sequence 
is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing the 

protein. 

28. The method of claim 27, wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
farnesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

29. The method of claim 27, wherein the isoprenoid comprises at least one 
member selected from the group consisting of an essential oil, a fragrance, a 
rubber, a carotenoid and a therapeutic compound. 

30. The method of claim 27, wherein the isoprenoid comprises a carotenoid. 

31 . The method of claim 27, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for gapd. 
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32. The method of claim 31 f wherein the nucleic acid sequence encodes an 
amino acid sequence which is completely identical to SEQ ID NO: 4. 

33. The method of claim 27, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

34. The method of claim 27, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

35. The method of claim 27, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

36. A method of modifying the production of isopentenyl pyrophosphate (IPP), 
dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, the 
method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein which modifies glyceraldehyde-3-phosphate 
(GAP) dehydrogenase enzyme activity in the host cell, relative to an untransformed 
host cell, wherein the heterologous nucleic acid sequence is operably linked to a 
promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing the 
protein. 

37. The method of claim 36, wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
farnesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

38. The method of claim 36, wherein the isoprenoid comprises at least one 
member selected from the group consisting of an essential oil, a fragrance, a 
rubber, a carotenoid and a therapeutic compound. 

39. The method of claim 36, wherein the isoprenoid comprises a carotenoid. 
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40. The method of claim 36, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

41 . The method of claim 36, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

5 42. The method of claim 36, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

43. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
which encodes for a protein having glyceraldehyde-3-phosphate (GAP) 
dehydrogenase enzyme activity, the method comprising inserting into the host cell 

10 a vector comprising the heterologous nucleic acid sequence, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter, and 
expressing the heterologous nucleic acid sequence. 

44. The method of claim 43, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for gapd. 

15 45. The method of claim 44, wherein the nucleic acid sequence encodes an 
amino acid sequence which is completely identical to SEQ ID NO: 4. 

46. The method of claim 43, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

47. The method of claim 43, wherein the heterologous nucleic acid sequence 
20 originates from a eukaryotic cell. 

48. The method of claim 43, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

49. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
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which encodes for a protein which modifies glyceraldehyde-3-phosphate (GAP) 
dehydrogenase enzyme activity in the host cell, relative to an untransformed host 
cell, the method comprising inserting into the host cell a vector comprising the 
heterologous nucleic acid sequence, wherein the heterologous nucleic acid 
5 sequence is operably linked to a promoter, and expressing the heterologous nucleic 
acid sequence. 

50. The method of claim 49, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

51 . The method of claim 49, wherein the heterologous nucleic acid sequence 
10 originates from a eukaryotic cell. 

52. The method of claim 49, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

53. A method of enhancing the production of isopentenyl pyrophosphate (IPP), 
dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, relative 

15 to an untransformed host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having LYTB activity, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing the 
20 protein. 

54. The method of claim 53, wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
famesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

55. The method of claim 53, wherein the isoprenoid comprises at least one 
25 member selected from the group consisting of an essential oil, a fragrance, a 

rubber, a carotenoid and a therapeutic compound. 
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56. The method of claim 53, wherein the isoprenoid comprises a carotenoid. 

57. The method of claim 53, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for lytB. 

58. The method of claim 57, wherein the nucleic acid sequence encodes an 
5 amino acid sequence which is completely identical to SEQ ID NO: 6. 

59. The method of claim 53, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

60. The method of claim 53, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

10 61. The method of claim 53, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

62. A method of modifying the production of isopentenyl pyrophosphate (IPP), 
dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, the 
method comprising 

15 inserting into the host cell a vector comprising a heterologous nucleic acid 

sequence which encodes for a protein which modifies LYTB activity in the host cell, 
relative to an untransformed host cell, wherein the heterologous nucleic acid 
sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing the 

20 protein. 

63. The method of claim 62, wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
famesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

64. The method of claim 62, wherein the isoprenoid comprises at least one 
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member selected from the group consisting of an essential oil, a fragrance, a 
rubber, a carotenoid and a therapeutic compound. 

65. The method of claim 62, wherein the isoprenoid comprises a carotenoid. 

66. The method of claim 62, wherein the host cell is selected from the group 
5 consisting of a bacterial cell, an algal cell and a plant cell. 

67. The method of claim 62, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

68. The method of claim 62, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

10 69. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
which encodes for a protein having LYTB activity, the method comprising inserting 
into the host cell a vector comprising the heterologous nucleic acid sequence, 
wherein the heterologous nucleic acid sequence is operably linked to a promoter, 
and expressing the heterologous nucleic acid sequence. 

15 70. The method of claim 69, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for lytB. 

71. The method of claim 69, wherein the nucleic acid sequence encodes an 
amino acid sequence which is completely identical to SEQ ID NO: 6. 

72. The method of claim 69, wherein the host cell is selected from the group 
20 consisting of a bacterial cell, an algal cell and a plant cell. 

73. The method of claim 69, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 
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74. The method of claim 69, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

75. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
which encodes for a protein which modifies LYTB activity in the host cell, relative 
to an untransformed host cell, the method comprising inserting into the host cell a 
vector comprising the heterologous nucleic acid sequence, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter, and 
expressing the heterologous nucleic acid sequence. 

76. The method of claim 75, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

77. The method of claim 75, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

78. The method of claim 75, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 
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Figure 5A: SEQ ID NO: 1 

Tagetes erecta (marigold) cDNA for DXP synthase 

1 ggcacgagaa gcactctcat attctctrgt gtgttttctg aatcacacca 

51 acaaatggct ttgtgtggtg ctttgaaggg tgggtttgtg cccatagctc 

101 aaaatggtta cacttcctca agtttgetta atccgtcagc caatgcaatc 

151 atgccatcaa acaagcgaaa gtttttggga atcgtagctg tttcaaaaga 

201 acatgcaacc aatgaacatg aggacctgac aacaatggat aagactacat 

251 caaccaccct caagtattct ggggataaac ccaaaactcc gatcttggat 

301 acgattaatt atccaattca cacgaaaaat ctttgcgtgg aggagctcgt 

351 aaaattagct gatgaactcc gagaagaaat tgtatatacg gtatcaaaga 

401 caggcggtca cttgagttcg agcttggggg ttgttgaatt aacagtttca 

451 ctccatcatg ttttcaacac tccagaagac aagatcatct gggacgttgg 

501 tcatcaggct tatccacata aaatattgac cggaagaagg tcgagaatga 

551 gaacgat"=g acaaacgttt ggacttgcag gcttccctaa gagagacgag 

601 agtaatcatg atgcttctgg tgcaggccac agctctacta gcatttctgc 

ff51 aggcttagga atggcrgtrg gtagagactc grtaggaaac aacaatcatg 

701 taattgcagt gatcggccac ggagcgatca ccgcaggaca agcatacgag 

751 gccatgaaca atgcacgrra ccnagatrcr aatctrcatca tagccttaaa 

801 tgacaacagg cagcrctczr tgcccacagc cac-attgat ggrcctgcgc 

851 cgcctgctgg cgcactcagc ccatccctca caaggcttca aacaagccaa 

901 aagtttcgcc aactacgtga agccgcaaac gaagtaacaa agcaattagg 

951 ggataaaaca catgaagczg cggcaaaaar ggactctcta gtgaaaggaa 

10 01 tggttggtgg tcaaggggct tccatgtttg aagagcttgg cctctattat 

1051 gttggtccgg tagatggaca caatcttgaa gatcttgtct atgctctcga 

1101 taagattaag tccatgacgg ctccgggacc agtcctagtc cacattgtta 

1151 ccgaaaaagg caaaggtzac ccgcccgctg aagttgctgc agataaaatg 

1201 catggtgttg tgaaatttga tactcaaacc ggcaaacaaa agaaaaataa 

1251 gaccaaaaca ctctcatata cccaatattt cgtggactct ttggttgctg 

13 01 aagcgaaaga ggatgacaag atagtcgcga ttcatgcggc tatgggcgga 

1351 ggcacaggcc ttaacacact ccaaaaagag ttecctgcac gttgttttga 

1401 tgttggcata gctgagcaac acgcaatcac ccctgctgca ggtttagcaa 

1451 cagaagggct gaaaccatct tgtgctattrt attcatcttt cctacaaaga 

1501 ggttatgatc aagtggttca tgatgttgat cttcaaaagc ttcctgttag 

1551 atttgctatg gatagggctg gtctggtagg agcagatggt ccaactcact 

1601 gtggtgcatt tgatacaaca ttcatggctt gttraccaaa catggtggcc 

1S51 atggctcctt catgcgaggc agagctcatg aacatggtgg ccacagctgt 

1701 agccatcgat gaccgtccta gctgttttag gtacccaaga ggaaatggaa 

1751 taggctccat accaccagca aataacaaag gaaccttaat cgaggttgga 

1B01 actggaaggg tgattaaaga gggaaataga gtggccttat tgggatatgg 
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FIGURE 5 A (cont.) 
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Figure 5B: SEQ ID NO: 2 

PREDICTED AMINO ACID SEQUENCE of Tagetes erecta (marigold) cDNA for 
DXP synthase 

TRANSLATION from: 55 to: 2232 Length: 726 
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Figure 6A: SEQ ID NO: 3 

Arabidopsis thaliana cDNA for a chloroplasts isoform of glyceraldehyde-3- 
phosphate (GAP) dehydrogenase. 
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Figure 6B: SEQ ID NO: 4 3 

Predicted amino acid sequence of Arabidopsis thaliana cDNA for a chloroplasts 
isoform of glyceraldehyde-3-phosphate (GAP) dehydrogenase. 



1 VPKGFTEFSG LRSSSASLPF GKKLSSDEFV SIVSFQTSAM GSSGGYRKGV 

51 TEAKLKVAIN GFGRIGRNFL RCWHGRJODSP LDIIAINDTG GVKQASHLLK 

101 YDSTLGIFDA DVXPSGETAI SVDGKIIQW SNROTSLLPW KELGIDIVIE 

151 GTGVFVDREG AGKHIEAGAK KVTITAFGXG DIPTYWGVN ADAYSHDEPI 

201 ISNASCTTNC LAPFVKVLDQ KFGIIKGTMT TTHSYTGDQR LLDASHRDLR 

251 RABAAALNIV PTSTGAAKAV ALVLPNLKGK LNGIALRVPT PNVSWDLW 

301 QVSKKTFAEE VNAAFRDSAE KELKGILDVC DEPLVSVDFR CSDFSTTIDS 

351 SLTMVMGDDM VKVIAWYDNE WGYSQRWDL ADIVANNWK* FHQTLFHVFP 

401 FFSLSLFYIM VLKISVKTV 
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Figure 7A: SEQ ID NO: 5 10/13 
Adonis palaestina cDNA 
Length: 1619 June 4, 1998 
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Figure 7B: SEQ ID NO: 6 11/13 

Predicted amino acid sequence of Adonis palaestina LYTB 

Length: 465 Translate 27-1421 
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FIGURE 8 
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FIGURE 9 
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